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Abstract 

We introduce a heterogeneous connection model for network formation to capture 
the effect of cost heterogeneity on the structnre of efficient networks. In the proposed 
model, connection costs are assumed to be separable, which means the total connection 
cost for each agent is uniquely proportional to its degree. For these sets of networks, we 
provide the analytical solntion for the efficient network as a function of connection costs 
and benefits. We show that the efficient network exhibits a core-periphery structure. 
Moreover, for a given link density, we find a lower bound for the clustering coefficient 
of the efficient network, and compare it to that of the Erdos-Renyi random networks. 
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1 Introduction 


Network formation models are increasingly being used in a variety of economic contexts 
and other multi-agent systems. These models often study the structural conditions of effi¬ 
ciency, network social welfare, and stability, which is a measure of individual incentives to 


form, keep or sever links (Jackson et ah (2008)). 


We build our model based on the Connection model proposed by Jackson and Wolinsky 


(1996 ^ in which agents can beneht from both direct and indirect connections, but only 


pay for their direct connections. Benehts of indirect connections generally decrease with 
distance. Jackson and Wolinsky ( |1996 ) demonstrated that, for the homogeneous case, the 
efficient network can only take one of three forms: a complete graph, a star or an empty 
graph depending on connection cost and benehts. Several models have been proposed to 


introduce heterogeneity into the connection model, (see for instance: Galeotti et al. (2006); 


Jackson and Rogers (2005); Persitz (2010); Vandenbossche and Demuynck (2013)); the focus 


has mainly been on conditions for stability, with few references to efficiency. Finding general 
analytical solutions for the efficient networks with heterogeneous costs can be intractable, 
see for example Carayol and Roux ( 2009| ). 

Here, we focus on hnding efficient networks for a particular model of cost heterogeneity 
that we refer to as the separable connection cost model, in which shares of nodes’ costs from 
each connection are heterogeneous, yet hxed and independent of to whom they connect. 
This is motivated by networks in which heterogeneous agents are each endowed with some 
resources (time, energy, bandwidth, etc) and the total resource needed to establish and 
maintain connections for each node can be approximated to be proportional to its degree. 
We further assume homogeneous benehts decaying with distance. We provide an exact 
analytical solution for efficient connectivity structures under these assumptions and show 


Gackson and Wolinsky (1996) developed their model based on the notion of pairwise stability or two¬ 


sided link formation where a link is formed upon the “mutual consent” of two agents. There is also another 


line of literature from Bala and Goyal (1997); Goyal (1993) that studies one-sided and non-cooperative link 


formation, where agents unilaterally decide to form the links with another agent. 


2 











































that such networks have at most one connected component, exhibit a core-periphery structure 
and have diameters no larger than two. We further provide a lower bound for the clustering 
coefficient and discuss the pairwise stability implications of efficient networks. 


2 Model 


For a hnite set of agents N = {1,..., 6}, let b : {1,..., n — 1} —)■ M represent the beneht 
that an agent receives from (direct or indirect) connections to other agents as function of the 


distance between them in a graph. Following Jackson and Wolinsky (1996), the (distance- 
based) utility function of each node, Ui{g), in a graph g and the total utility of the graph, 
U{g), are as follows: 


Ma) = Kdijia)) - 

n ^ ^ 

= '^Ui{a) 

i=l 

where Ni{g) is the set of nodes to which i is linked, and Nf{g) is the set of nodes that 
are path connected to i by a distance no larger than k. dij{g) is the distance between i and 
j, Cij is the cost that node i pays for connecting to j, and b is the beneht that node i receives 
from a connection with another node in the network. We assume that b{k) > b{k J- 1) > 0 
for any integer k > 1. 

Let complete graph g^ denotes the set of all subsets of N of size 2. The network g is 
efficient, if U{g) > U{g') for all g' C g^, which indicates that g = argmax^ Mj( 5 f). 
Assuming the separable cost model as introduced earlier, connection costs in Equation for 
a link between i and j can be written as c^- = c,, Cji = Cj. We then introduce a connection 
cost vector, C, and without loss of generality rename nodes such that ci < C 2 < ■■■ < Cn- 
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Figure 1: Structure of the efficient network with separable heterogeneous connection model. 
The connected component has a generalized star structure, n is the number of nodes in 
the network, m is the number of nodes in the connected component, and k is the number 
of nodes in the core (complete subgraph). Nodes are sorted according to their costs i.e. 
Cl < C2 < ■ ■ ■ < c„. 

2.1 Efficient Structures under Separable Connection Costs 

In Lemma [T] we determine the efficient structure for a connected component. Then in 
Proposition we determine the structure of the efficient network in general. 

Lemma 1 If the efficient network with separable cost model is connected then it has a “gen¬ 
eralized star” structure with the following characteristics: (a) All nodes are connected to 
node 1 ( the node with minimum connection cost), (b) Nodes i and j (i,j 7 ^ 1 j are connected 
- &( 2 ) > .5(ci + Cj). 

Proof. Let N represents nodes in the connected network. If there exists a subset of nodes 
M = {ui, ...,nm} [M C N) that are not connected to node 1, we show that the network is 
not efficient. Since N is connected, there exists a set of links, L = where U is 

adjacent to Suppose /j connects n* to tc, ( tCj 7 ^ 1 by dehnition). Now, if we remove all 
/jS and connect all UjS to node 1 , we have reduced the total connection cost of the network 

m 

by mci - < 0 - 

k=l 

Now, to address the benehts, note that we have not changed the number of links; there¬ 
fore direct benehts remain the same. Furthermore, the diameter of the new network is 2. 

^For any subset M = {iii, • • • ,Vm} of a connected network N (M ^ N), we can show that there are 
links L = {h, • • • , Im} in N such that Vi is adjacent to h- 
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Therefore every distance that is not 1 is capped at 2, making the total beneht larger than 
that of the original network. This resnlts in at improvement in the total ntility, indicating 
that the original network was not efficient. 

Fnrthermore, having established that the maximnm distance in the efficient network is 
no larger than two, every node i and j {i,j ^ 1 ) are connected iif 6 ( 1 ) — 6 ( 2 ) > .5(ci + Cj). 

■ 

Proposition [^determines the structnre of the efficient network and shows that the efficient 
network is a spectrnm of solntions. 

Proposition 1 In the connection model, for a finite set of agents, N = {1, ..,n}, if Cij = Cj 
for all i,j G N, where Ci E C = {ci, C 2 ,..., c„} and assuming, ci < C 2 < • • • < the 
structure of the efficient network is as follows: Let m he the largest integer between 1 and n 
such that 26(1) + 2 (m — 2 ) 6 ( 2 ) > (c^ + Ci). If i > m, then i is isolated. If i < m, then there 
is exactly one link between i and 1; also there is one link between i and j (I < i,j < m) iff 
6 ( 1 ) - 6 ( 2 ) > .5(ci + Cj). 

Proof. 

First, we show there is at most one connected component in the efficient network. Next, 
we hnd the condition for each node to be in the connected component, which has the gener¬ 
alized star strnctnre according to Lemma [Tj 

Assnme that the efficient network has more than one (e.g. two) connected components 
with {mi,£i) being respectively the nnmber of nodes and links in connected component 
i. According to Lemma 1, each connected component has a generalized star strnctnre. 
The total beneht of each component is Bi = 2iib{l) + {mfimi — 1) — 2f'i)6(2) and B 2 = 
2 £ 2 b{l) + {m 2 {m 2 — l) — 2 £ 2 )b{ 2 ) respectively. Snppose h and h' are nodes with minimnm costs 
in component 1 and 2 respectively and withont loss of generality Ch < Ch'. If we disconnect all 
links connected to h' and connect them directly to h, total cost decreases by (ch'—Ch) per link. 
This also results in the total beneht B = {£i+£ 2 )b{l) + {{mi+m 2 ){mi+m 2 — l)—2{£i+£2))b{2), 
which is strictly greater than {Bi + B 2 ). 
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To determine which nodes belong to the connected component, Gc, in the efficient net¬ 
work, we dehne, for node i, Ai = 26(1) -|- 2{k — 2 ) 6 ( 2 ) — ci — q where k is the number of 
nodes in the connected component Gc- We show that i is in Gc iff Ai > 0. First, A* > 0 is 
the sufficient condition for i to be in Gc- This is because connecting i to node 1 increases 
the total utility by exactly Ai, as the diameter of Gc is at most 2, according to Lemma 1. 
Also if Aj < 0, then i will be isolated so Aj > 0 is also the necessary condition. This is 
because i cannot be only connected to 1 since Aj < 0 , so the only way for i to be connected 
is by having more than one link. From Lemma for i to have a link to j 7 ^ 1, we must have 
Cj -1- Cj < 26(1) — 26(2). But: c* -1- Cj > Ci + Ci > 26(1) -t- 2{k — 2)6(2) > 26(1) — 26(2), where 
we use the fact that Cj > ci and Aj < 0 , so z cannot have more than one connection either, 
thus i will be isolated. Note that A* > 0 also means Aj > 0 for all j < i since Cj < Ci, thus 
all lower cost nodes will also be in Gc, so the smallest i for which Aj < 0 provides the size 
of the connected component in the efficient network. 


A typical structure for the efficient network with heterogeneous separable cost model is 
illustrated in Figure [TJ 


2.2 Characteristics of Networks with Separable Cost Model 

2.2.1 Core-Periphery structure 


We show that the efficient network has a Core-periphery structure, a widely observed 


structure in various social and economic networks (i.e. see for example Zhang et ah (2014); 


Rombach et ah (2014)). We adopt the formal dehnition from Bramoulle (2007), which 


states that a graph g has a core-periphery structure when agents can be partitioned into 
two sets, the core G and the periphery P, such that all partnerships are formed within the 
core and no partnership is formed within the periphery. For an efficient network, let k be 
the largest integer between 2 and n such that 6(1) — 6(2) > .5(cfc_i - 1 - c^). The efficient 
network can be partitioned into a set C = { 1 ,..., fc}, which forms a complete subgraph and 
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a set P = {k + 1,... ,n} which can only have connections to the complete subgraph. If 
6(1) — 6(2) > (cfc_i + Cfc) then k and k — 1 are connected and there is also a link between 
every node i,j{i,j < k and q, Cj < Ck), which forms a complete subgraph. Similarly, we can 
show that for every i E P, which is connected to a node j, Cj < Ck and j ^ P. 

2.2.2 Clustering coefficient 

Unlike efficient networks with homogeneous cost model whose clustering coefficients are 
either one (complete graph) or zero (star structure or empty graph), the clustering coefficient 
of the efficient network with heterogeneous costs covers a wide range. To find a lower bound, 
we find the minimum global clustering coefficient of all the efficient networks with a given 
link density and various connection cost values. To this end, we construct a network that 
does not break the efficiency condition, provided in the previous section, while producing 
the minimum possible number of triangles for the given density as follows: Starting from an 
empty graph and node k = 1, we only establish links from node k to every node i > k in 
ascending order. We repeat this process for k = 1,... ,p + 1 until the total number of links 
reaches i. p is the largest integer such that i = X]fc=i(^ — k) + J, where J is the residual 
number of links in the p +1 round of incomplete iteration (J < n—p+1). At the /c-th iteration, 
the total number of connected triplets is increased by ("^ 2 ^) + 2{k — l){n — k) + ( 2 ) + 2Jp. The 
first term in this equation results from the fact that node k will act as a local hub with [n — k) 
new links who provide new triplets. The second term is the additional triplets resulting 
from new triangles formed in the k-th round since there are {k — 1 ) (n — k) triangles formed in 
round k which add 3 triplets, one of which have already been counted in previous iterations. 
The remaining two terms follow a similar logic for the residual links (J). Similarly, the total 
number of triangles formed at /c-th iteration is increased by {k — l){n — k) + pJ. Therefore, 
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for a given number of links {£), we have: 


^min 

^eff 


{9 A) = 


3 X number of triangles 


number of connected triplets of nodes 
3x - l){n - k) + pJ} 

ELi{(V) + 2(fc - l)(n - k)} + 0 + 2Jp 


( 2 ) 


Compared to an Erdos-Renyi (ER) random network with the same link density, some algebra 
reveals that the clustering clustering coefficient of the efficient heterogeneous network exceeds 
that of an ER network when then number of nodes is greater than 10 and the link density 
is larger than For sufficiently large networks, the condition simplifies to having link 

density greater than 


3 Conclusion 

Heterogeneous yet separable connection costs cover important classes of real networks. 
We showed that efficient structures for such networks can be solved exactly and have diameter 
no larger than two; we also discussed the transitivity and core-periphery nature of such 
networks. Although benefits are still assumed to be homogeneous, one can easily take into 
account heterogeneity of direct benefits, as long as the separability assumption is maintained, 
i.e. cost and direct benefit terms appear together in all analysis and cost terms can capture 
heterogeneity of direct benefits by embedding them as an off-set in the fixed costs of nodes. 

We mainly focused on the notion of efficiency for the networks with separable connection 
cost model. Further studies can investigate stability for the proposed model, and find con¬ 
ditions under which stable and efficient structures coincide. Moreover, there are cases where 
the separable cost assumption does not hold, for example when the link cost is the function 
of how similar the two nodes are. These cases in general can be intractable and approximate 


methods such as island models as discussed in Jackson and Rogers (2005) can be used. 
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